Computer and Modernization ›› 2012, Vol. 1 ›› Issue (1): 34-36,8.doi: 10.3969/j.issn.1006-2475.2012.01.009

• 人工智能 • Previous Articles     Next Articles

An XML Retrieval Document Similarity Algorithm Based on Bayesian Classifier

HAN Xiao-mei, ZHENG Hong-yuan, DING Qiu-lin   

  1. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
  • Received:2011-09-09 Revised:1900-01-01 Online:2012-01-10 Published:2012-01-10

Abstract: At present, the similarity calculation for inquires is usually considered by comparing retrieval results to inquires. This paper proposes an algorithm based on Bayesian classifier to calculate the similarity of XML search results. On the basis of working out similarity of each document and inquire, it divides XML retrieval documents into relevant sets and uncorrelated sets by using Bayesian classifier. Then, final similarity is obtained by calculating the similarity of relevant documents and uncorrelated documents. At last, the experimental analysis shows that the new algorithm improves the retrieval performance effectively about 15 percent higher than traditional method without affecting recall ratio.

Key words: Bayesian classifier, inquire similarity, XML retrieval document, information retrieval

CLC Number: